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(54) Apparatus and method for providing a generic interface between a host system and an 
asynchronous transfer mode core functional block 



(57) A generic Input/Output interface between an lO 
block and a System and ATM Layer Core on a network 
interface circuit is provided. The GIO interface includes 
parallel DMA read and write control liandshake signal 
lines; parallel DMA read and write data handshake sig- 
nal lines which operate independently from the read and 
write control handshake signal lines; parallel DMA read 
and write data signal lines; and a single clock signal line. 



GIO interface facilitates maximum utilization of the lO 
bandwidth, and allows several requests to be queued 
across the GIO interface at the same time, in each read 
and write direction. In addition, the GIO interface utilizes 
a fixed clock for driving the transmit and receive data 
path. By thus referencing all transactions to a clock driv- 
ing the Core, the Core remains unchanged for different 
embodiments of the network interface circuit which in- 
terface to different host computer systems and busses. 




FIG. 3 
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Description 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The present invention relates generally to the field 
of computer systems and, in particular, to a method and 
apparatus for providing a generic input/output interface 
to any Input/Output bus between a host computer sys- 
tem and an Asynchronous Transfer IVIode (ATM) net- 
work interface Core functional blocl<. 

2. Description of the Related Art 

ATM technology is emerging as the preferred tech- 
nology for sending infornnation at very high speeds be- 
tween a transmission point and one or more receiving 
points. An ATIVI system facilitates the transmission of 
data over such a network by defining a set of "data con- 
nections" in which each connection represents a virtual 
circuit having a particular source and destination and an 
associated data transmission rate. 

One particular implementation of an ATM system, 
shown in Figure la, employs a Cell Interface block con- 
figured to implement a Universal Test and Operational 
Physical Interface (UTOPIA) protocol. A network inter- 
face circuit typically cooperates with the cell interface 
block to transfer data between the host computer and 
the other computers in the network. For transmission of 
cells to the ATM Cell Interface, multiple packets of data 
are accessed over an Input/Output (lO) Bus and provid- 
ed to a System and ATM Layer Core (Core) where the 
data is segmented and then transmitted to the ATM Cell 
Interface (I.e., Utopia), For transmission of packets to 
the host device, the Cell Interface block first receives 
cells from the ATM Cell Interface (Utopia) and then for- 
wards the cells to the Core for reassembly into packets 
before transferring the packets to the host device or an- 
other local area network 

In the implementation of Figure la, all transactions 
within the ATM Core functional block are driven by the 
System 10 bus clock. Since the ATM Cell Interface (Uto- 
pia) receives data from the ATM core at the same lO 
clock, the link rate of the device data transmission be- 
tween the host system and the ATM Cell Interface (Uto- 
pia) is unnecessarily limited. 

In addition, the Core in this conventional circuit can 
only Interact with a specific System lO bus, i.e., the Sys- 
tem lO bus it is configured to interface with, since the 
Core is driven by the System 10 bus clock. For example, 
if an interface is configured to interact with an S-Bus, it 
will have to be redefined for supporting a faster bus such 
as the PCI bus. This results in inflexibility and overall 
inefficiency since the Core is then dependent on the 
configuration of the lO block it is interfaced to. In addi- 
tion, this conventional arrangement does not provide for 
pipelining of the Core transmit load and receive unload 
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DMA requests. Furthermore, in using this arrangement, 
the Core has to wait for the TX FIFO to be completely 
full or for the RX FIFO to be completely empty before 
transmitting data or receiving data to or from the media. 

s As a result, the 10 bus bandwidth is not fully utilized. 

Figure 1 b illustrates another conventional approach 
in providing an interface between a host system and an 
ATM Core functional block. This conventional circuit at- 
tempts to solve the problem existing in the circuit of Fig- 

70 ure la by providing synchronization of the signals driving 
the Core to the clock driving the lO bus, as shown in 
Figure lb. However, this approach Is complicated since 
all control handshake, data handshake and the data 
transfer rates have to be correlated to the lO bus clock. 

'5 In addition, it utilizes clock synthesizing logic to provide 
a fast enough clock to Utopia so as not to limit link rate. 
Moreover, this approach restricts the application of the 
Core to a specific 10 bus. Like the circuit of Figure la, 
this conventional circuit does not provide for pipelining 

20 of the Core transmit load and receive unload DMA re- 
quests. In addition, this conventional approach, the 
Core has to wait for the TX FIFO to be completely full 
or for the RX FIFO to be completely empty before trans- 
mitting data or receiving data to or from the media. As 

25 a result, the lO bus bandwidth is not fully utilized with 
this approach either 

Accordingly there Is an additional need in the tech- 
nology to provide a system for synchronizing the 10 
clock and the Core clock which facilitates ease of sys- 

30 tern design and Core logic re-usabillty, and which pro- 
vides for pipe lining of the Core transmit load and receive 
unload DMA requests, so that utilization of the lO band- 
width can be maximized. 

35 BRIEF SUMMARY OF THE INVENTION 

A generic Input/Output interface between an 10 
block and a System and ATM Layer Core on a network 
interface circuit Is provided. The GIO Interface Includes 

40 parallel DMA read and write control handshake signal 
lines; parallel DMA read and write data handshake sig- 
nal lines wh ich operate independently from the read and 
write control handshake signal lines; parallel DMA read 
and write data signal lines; and a single clock signal line. 

45 The design and definition of the GIO interface facil- 
itates maximum utilization of the lO bandwidth, for ex- 
ample, by allowing back-to-back DMA requests over the 
lO bus. It also allows several requests to be queued 
across the GIO interface at the same time. In each read 

50 and write direction, and permits the Core to read or write 
data in increments rather than having to wait for the 
DMA buffer to be completely filled or emptied out. 

In addition, the GIO interface utilizes a fixed clock 
for driving the transmit and receive data path. The clock 

55 is used for the entire Core and lO synchronization is 
pushed over to the lO block. By thus referencing all 
transactions to the clock driving the Core, the Core re- 
mains unchanged for different embodiments of the net- 
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work interface circuit which interface to different host 
computer systems and busses. In addition, when the 
Core is 622 IVlbps duplex-ready, it can be coupled to any 
lO blocl< to achieve a transmission data rate of up to 622 
Mbps duplex. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1a illustratGS one conventional approach in 
providing an interface between a host system and an 

ATM Core functional block. 

Figure 1 b illustrates another conventional approach 
in providing an interface between a host system and an 
ATM Core functional block. 

Figure 2 illustrates an exemplary computer system 
network incorporating an asynchronous transfer mode 
network interface circuit which utilizes the method and 
apparatus of data transfer coordination of the present 
invention. 

Figure 3 is an overall system diagram illustratingthe 
architecture of the asynchronous transfer mode network 
interface circu it wh ich utilizes the method and apparatus 
of data transfer coordination in accordance with one em- 
bodiment of the present invention. 

Figures 4a is a detailed block diagram illustrating 
the System-IO interface 20, the Generic Input/Output 
("GIO") Interface 24, and the Core 22 of Figure 3, in ac- 
cordance with one embodiment of the present invention. 

Figure 4b is a block diagram which illustrates in de- 
tail the signal lines shown in Figure 3a. 

Figures 5a and 5b are timing diagrams illustrating 
the timing cycles for the read and write control hand- 
shake signals between the System-IO Interface 20 and 
the Core 22, respectively. 

Figures 5c and 5d are flow charts illustrating the 
read control handshaking processes shown in Figures 
5a and 5b respectively. 

Figures 6a and 6b are timing charts illustrating the 
timing cycles of the read data handshake signals shown 
in Figures 4a and 4b. 

Figures 6c and 6d are flow charts illustrating the 
read and write data handshake processes shown in Fig- 
ures 5a and 5b. 

DETAILED DESCRIPTION OF THE INVENTION 

Figure 2 illustrates an exemplary computer system 
network incorporating the ATM network interface circuit 
which utilizes the method and apparatus of data transfer 
protocol of the present invention. The computer system 
network 1 0 includes host computer systems (not shown) 
which incorporate one or more of the ATM network in- 
terface circuits (NIG) 12. The NICs 12 are coupled 
through a local ATM switch 14 to a public ATM switch 
1 6 to enable asynchronous transfer of network data be- 
tween host computer systems coupled to the network 
10. Alternately, the NICs 12 can be coupled directly to 
the public ATM switch 16. As shown in Figure 1, the 
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computer system network 10 may also include compu- 
ter systems which incorporate the use of a Local Area 
Network ("LAN") emulation 15 which serves as a gate- 
way for connecting other networks such as Ethernet or 
5 token ring networks 17 which utilize the ATM network 
as a supporting framework. 

Figure 3 is a simplified system diagram illustrating 
the architecture of the ATM NIC 12 which utilizes the 
method and apparatus of data transfer coordination in 
10 accordance with one embodiment of the present inven- 
tion. The ATM NIC 12 interfaces a host computer system 
48 coupled through System-IO bus 38 to the ATM Cell 
Interface 40 operating in accordance with the ATM pro- 
's The ATM NIC 12 shown includes a System-IO in- 
terface 20, a Generic Input/Output ("GIO") interface 24, 
a System and ATM Layer Core 22, a Local Slave inter- 
face 26, a transmit (TX) FIFO 28, a receive (FIX) FIFO 
30, a Cell Interface block 32, an External Buffer Memory 
20 Interface 34 and a clock synthesis circuit 36. 

Together, the elements 20-36 of NIC 12 cooperate 
to transfer data between the host computer 48 and the 
other computers in the network through multiple, dy- 
namically allocated channels in multiple bandwidth 
25 groups. Collectively, the elements of the network inter- 
face circuit 12 function as a multi-channel Intelligent di- 
rect memory access (DMA) controller coupled to the 
System-IO bus 38 of the host computer system 48. In 
one embodiment, multiple transmit and receive chan- 
ge nels are sen/iced as virtual connections utilizing a full 
duplex 155/622 Mbps (Mega bits per second) physical 
link. Multiple packets of data, subscribed to different 
channels are accessed over the System-IO bus 38 to 
the external buffer memory 42, via the External Buffer 
35 Memory Interface 34, are segmented by the Core 22 into 
transmit cells for transmission to the ATM Cell Interface 
40 through Cell Interface block 32, The Core 22 also 
comprises reassembly logic to facilitate reassembly of 
the received cells to packets. 
40 The TXand RX buffers, such as TX and RX FIFOS 
28 and 30, coupled between the Core 22 and the Cell 
Interface block 32, are used to stage the transmit and 
receive ATM cells of the transmit and receive packets 
respectively. The Cell Interface block 32 transmits and 
45 receives cells to or from the ATM Cell Interface 40 of the 
network, driven by clock signals provided by Clock Syn- 
thesis Circuit 36, Preferably the ATM Cell Interface 40, 
and therefore the Cell Interface block 32, conforms to 
the Universal Test and Operations Physical Interface for 
so ATM ("UTOPIA") standard, as described by the ATM Fo- 
rum specification. Thus, in one preferred embodiment, 
the ATM Cell Interface 40 is UTOPIA. To conform to the 
UTOPIA specification, the clock synthesis circuit 36 pro- 
vides either a clock signal of 20 MHz or 40 MHz to en- 
ss able the Cell Interface block 32 to support an 8 bit stream 
at 20 MHz for 155 Mbps or a 16 bit stream at 40 MHz 
for a 622 Mbps data stream. 

In the present embodiment, the Cell Interface block 
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32 rsceives 52-byte data cells each having a 4-byte cell 
header and a 48-byte payload from the TX Fl FO 28. The 
Cell Interface block 32 inserts a header checksum as a 
fifth byte to the cell header Into each cell prior to provid- 
ing the 53-byte data cell to the ATM Cell Interface 40. 
Conversely, when the Cell Interface block 32 receives 
cells from the ATM Cell Interlace 40, it examines the 
header checksum in the fifth byte of each ceii to deter- 
mine if the checl^sum is correct. If so, the byte repre- 
senting the checl<sum is stripped from the cell and the 
52 byte data cell is forwarded to the RX FIFO 30, other- 
wise the entire cell is dropped. 

The System-IO Interface 20 and GIO interface 24 
insulate the host computer system 48 from the specifics 
of the transfer to the ATM Cell Interface 40. Further- 
more, the Core 22 is Insulated from the specifics of the 
system bus 38 and host specifics. In the present pre- 
ferred embodiment, the System Bus is an S-Bus, as 
specified in the Institute of Electronics and Electrical En- 
gineers ("IEEE") standard 1496 specification. The Sys- 
tem-IO Interface 20 is configured to communicate in ac- 
cordance with the specifications of the System-IO bus 
38, in the present illustration, the S-Bus, which transfers 
data in 32-bit and 64-bit formats. It is contemplated that 
the System-IO Interface 20 can be configured to con- 
form to different host computer system busses. The 
System-IO Interface 20 is also configured to transfer 
and receive data in accordance with the protocols spec- 
ified by the GIO interface 24. 

The GIO interface 24 provides a singular interface 
through which the Core 22 communicates with the host 
computer. Thus, the Core 22 does not change for differ- 
ent embodiments of the NIC 12 which interface to dif- 
ferent host computer systems and busses. The GIO in- 
terface 24 utilizes a 40 MHz clock provided by clock syn- 
thesis circuit 36 for driving the transmit and receive data 
path so as to obtain a 622 Mbps full duplex operation. 
The 40 MHz clock is used for the entire Core 22 and lO 
synchronization is pushed over to the lO block (not 
shown) in the System-IO Interface 20. By thus fixing the 
clocl< driving the Core 22, the Core 22 remains fixed for 
different embodiments of the NIC 12 and only the Sys- 
tem-IO Interface 20 has to be altered for interfacing with 
a different System-IO bus 38. 

Three memory sub-systems are associated with the 
operation of the NIC 12. These include the host memory 
49 located in the host computer system 48, the external 
buffer memory 42 external to the NIC 12 and storage 
block 44 located in the Core 22. The NIC 12 manages 
two memory areas: the external buffer memory 42 and 
the storage block 44. The external buffer memory 42 
contains packet data for all transmit and receive chan- 
nels supported by the Nl C 1 2. The storage block 44 con- 
tains DMA state information for transmit and receive 
channels and pointers to data structures in host memory 
49 for which DMAtransfers are performed. The storage 
block 44 also contains the data structure specifics to 
manage multiple transmit and receive buffers for pack- 
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ets in transition between the host 48 and the ATM Cell 
Interface 40. 

The host computer system 48 includes host mem- 
ory 49 which contains data packets and pointers to the 

5 packets being transmitted and received. As noted pre- 
viously, the NIC 12 also shields the cell delineation de- 
tails of asynchronous transfer from the applications run- 
ning on the host computer system. For present purpos- 
es, it is assumed that software running on the host com- 

70 puter system 48 manage transmit and receive data us- 
ing wrap around transmit and receive rings with packet 
interfaces as is well known in the art. 

Figures 4a and 4b are detailed block diagrams illus- 
trating the System-IO interface 20, the GIO interface 24, 

'5 and the Core 22 of Figure 3, in accordance with one em- 
bodiment of the present invention. As shown, the Sys- 
tem-IO interface 20 includes a System-IO/ATM 
(SB_ATI\^) block 48 which consists of two 14-location 
deep, 64-bit wide wrap-around FIFOs - a Read (RD) 

20 FIFO 50 and a Write (WR) FIFO 52. The RD FIFO 50 
and WR FIFO 52 respectively support more than one 
64-byte read and write data bursts requested by the 
Core 22, which eventually result In read and write ac- 
cesses across the System-IO bus 38. 

25 The Core 22 includes an ATM_SYS block 54 which 
initiates simultaneous read and write requests via 
TX_ATMSYS circuit 56 or RX_ATMSYS circuit 58, to the 
SB_ATM blocl^ 48. When thus initiated, the SB_ATM 
block 48 performs arbitration to select the next request 

30 to honor. Read and write addresses, sizes and targets 
are separately provided by the Core 22 at the same time 
the request is issued. The addresses, sizes and targets 
can be dispatched as quickly as one clock cycle from 
the time they were issued, as will be discussed in detail 

35 In the following sections. 

The GIO interface 24 of the present invention com- 
prises 4 groups of signal lines as shown in Figure 4a, 
namely (1) the parallel DMA read and write control 
handshake signal lines 60; (2) the parallel DMA read 

40 and write data handshake signal lines 62 which operate 
independently from the read and write control hand- 
shake signal lines 60; (3) the parallel DMA read and 
write data signal lines 64; and (4) a single clock signal 
line 66. The circuits for generating and receiving the sig- 

45 nals 60 are disclosed in co-pending U.S. Patent Appli- 
cation No. 08/499,199 titled "A Method and Apparatus 
for Allowing Packet Data to be Separated Over Multiple 
Bus Targets" filed on July 7, 1 995 by the same inventors 
and assigned to the same assignee as the present ap- 

50 plication. The subject matter in U.S. Patent Application 
No. 08/499,199 is hereby incorporated by reference. 
The circuits for generating and receiving the signals 62 
are disclosed in co-pending U.S. Patent Application No. 
08/498,618 titled "Method and Apparatus for Dynami- 

55 cally Calculating Degrees of Fullness of a Synchronous 
FIFO" filed on July 7, 1 995, which was assigned to the 
same assignee as the present application. The subject 
matter in U.S. Patent /Application No. 08/498,618 is 
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hereby incorporated by reference. The circuits for gen- 
erating and receiving signals 64 and 66 may be imple- 
mented by one skilled in tlie art. 

The parallel DMA read and write control handshake 
signals provided via signal line 60 permit simultaneous 
read and write DMA requests to be sent to the SB_ATM 
block 48 in the System-IO Interface 20 from the 
ATM_SYS block 54. This in turn permits the SB_ATM 
block 48 to perform arbitration to allow one of the re- 
quests to be asserted on the System-IO bus 38 at a time. 

The paraiiei DMA read and write data handshake 
signals provided via signal line 62 are synchronous to 
the Core 22 clock domain and indicate in real time how 
full or empty the RD FIFO 50 or WR FIFO 52 is, rather 
than indicating if the FIFOs 50 or 52 are totally full or 
totally empty This finer granularity of flags aiiow faster 
data flow across the GIO interface 24. The data hand- 
shake signals 62 are independent of the control hand- 
shake signals 60 and thus permit smaller DMA requ ests, 
such as descriptor updates, to be provided along with 
the larger (64 byte) DMA requests, resulting in improved 
throughput. 

The DMA read and write data is provided via a du- 
plex, 32-bit signal line 64 to preserve the full bandwidth 
of the external buffer memory interface 34 so as to fa- 
cilitate duplex 622 Mbps segmentation and reassembly 
A single clock signal is provided via signal line 66. In 
one embodiment, this clock signal is 40 MHz. 

Figure 4b illustrates in detail the four signal lines of 
Figure 4a. The parallel DMA read and write control 
handshake signal lines 60 include the rd_req, the 
rd_ack, the rd_address, the rdjarget, the rd_size, the 
rd_done, the wr_req, the wr_ack, the wr_address, the 
wrjarget and the wr„size signal lines. The rd_req and 
wr_req signal lines provide signals from the ATM_SYS 
block 54 to the SB_ATM block 48, and the signals thus 
provided respectively initiate a read and write request 
on the rising edge of the signal. The rd_ack and wr_ack 
signal lines each provide signals from the SB_ATM 
block 48 to the ATM_SYS block 54, which are asserted 
to indicate that the rd_req and the wr_req signals are 
respectively granted and the ATM_SYS block 54 is free 
to continue its functions and may change the read or 
write address, size, or targets thereafter. 

The rd_address and the wr_address signal lines 
each provide signals from the ATM_SYS block 54 to the 
SB_ATM block 48, which indicate the memory address- 
es to be used during the read and write transactions re- 
spectively The rdjarget and wrjarget signal lines pro- 
vide signals from the ATM_SYS 54 to the SB_ATM block 
48, which indicate whether the lO bus target device for 
the read and write request, respectively, is the slave 
card or the host memory. The host memory and slave 
card are both DMA bus target devices. The slave card 
is an S-bus or lO bus device coupled to the System lO 
bus 38, which sen/es as a slave. One embodiment of 
the slave card is described in co-pending U.S. Patent 
Application No. 08/499,199 titled "A Method and Appa- 



ratus for Allowing Packet Data to be Separated Over 
Multiple Bus Targets" filed on July 7, 1995 by the same 
inventors and assigned to the same assignee as the 
present application. The subject matter in U.S. Patent 

5 Application No. 08/499,199 is hereby incorporated by 
reference. When set to 0, it indicates that the target de- 
vice is the host memory, when reset to 1 , it indicates that 
the target device is the slave card. The rd_size and 
wr_size signal lines provide signals from the ATM_SYS 

10 54 to the SB_ATM block 48 which indicate the raw size 
in byte units, of the data required by the read and write 
request, respectively Finally, the rd_done signal line 
provides a signal from the SB_ATM block 48 to the 
ATM_SYS block 54 to indicate to the ATM_SYS block 

'5 54 that the last word or doubleword has been written 
into the RD FIFO 50 and that the ATM_SYS block 54 
can read off the rest of the data without looking at the 
Fl FO flag. This is to prevent the last word(s) of data from 
lingering in the RD FIFO 50 if there is no new DMA read 

20 scheduled for updating the FIFO flags. 

The parallel DMA read and write data handshake 
signals provided via signal line 62 include the flags, the 
atm_rd_en, atm_wr_en, last_rd and last_wr signal lines. 
The flags signal line includes an 

25 at_ieast_x_wordsJilled flag, where x is an integer, to 
the ATM_SYS block 54 that it can start reading dataf rom 
RD FIFO 50 if desired. The flags signal line also includes 
an atJeast_x_words_empty signal line to signal 
ATM_SYS block 54 that it can start writing data into WR 

30 FIFO 52 if desired. These flags are mclk-based. The 
atm_rd_en and atm_wr_en signal lines respectively as- 
sert a signal to signify to the RD FIFO 50 and WR FIFO 
52 that it is reading and writing data, respectively The 
duration of each of the signal indicates how many 32-bit 

35 words are to be written. However, there is no restriction 
on the number of words to be written. The iast_rd and 
last_wr signals are provided by the SB_ATM block 48 
across the GIO interface 24 to allow 64 bit-wide 10 block 
data path implementation. Where on a 32-bit wide data 

40 path is implemented, the last_wr and last-rd signal lines 
can be left unconnected. 

Figures 5a and 5b are timing diagrams illustrating 
the timing cycles for the read and write control hand- 
shake signals between the System-IO Interface 20 and 

45 the Core 22. Figures 5c and 5d are flow charts illustrat- 
ing the read control handshaking processes shown in 
Figures 5a and 5b respectively As shown in Figure 5a, 
the rd_req signal is asserted via the rd_req signal line 
when there is a request to read data. Since the GIO In- 

50 terface 24 has to service multiple read and write re- 
quests, fair arbitration of these requests has to be per- 
formed to allow one of the requests to be asserted on 
the System-IO bus 38 at a time. The apparatus and 
method described in pending U.S. Patent /application 

55 No. 08/499,199 titled "A Method and Apparatus for Al- 
lowing Packet Data to be Separated over Multiple Bus 
Targets" filed on July 7, 1995, which was assigned to 
the assignee of the present invention, and which is in- 
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corporated herein, describes one embodiment of such 

arbitration. 

In the event that a read request is honored upon 
completion of arbitration, the rd_acl< signal is asserted, 
as shown in Figure 5a. The rdjarget, rd_address and 
rd_size signals from the ATM_SYS blocl< 54 are regis- 
tered with the SB_ATI\/1 bloclc 48 one clock cycle after 
rd_req Is asserted. This Is done so that the ATM_SYS 
block 54 can then be released to process other requests 
without waiting for or be held up by the acknowledgment 
of read or lO arbitration. The read cycle is initialed in the 
meantime. At the time the first good byte or word is writ- 
ten into the RD FIFO 50, the rd_acksignal is deasserted. 
This allows the ATM_SYS block 54 to assert another 
rd_req, if there is one pending. When reading is done, 
the rd_done signal is asserted bytheSB_ATMblock48. 

The read control handshake process will now be de- 
scribed with reference to Figure Sc. Beginning from a 
start state, the process SI 00 proceeds to decision step 
S102, where it queries if a read request has been hon- 
ored. Ifnot, the process SI 00 proceeds back to decision 
step SI 02, to continue monitoring whether a read re- 
quest has been honored. If a read request has been 
honored, the process SI 00 proceeds to process step 
SI 04 where the rd_ack signal is asserted. At this junc- 
ture, the process SI GO proceeds in two parallel opera- 
tions, represented by processes SlOOa and SlOOb. Ad- 
vancing from process step S104, process SlOOa pro- 
ceeds to process step 8106, where the read cycle is in- 
itiated. It then proceeds to decision step SI 08 where it 
queries If reading has been completed. If not, the proc- 
ess SI 00a proceeds back to decision step S 1 08. If read- 
ing has been completed, the process SlOOa proceeds 
to process step S110, where the rd_done signal is as- 
serted. The process SI 00a then terminates. 

Advancing from process step SI 04, the process 
SI 00b proceeds to process step S112, where the 
rdjarget, rd_address and rd_size signals from the 
ATM_SYS block 54 are transferred and registered with 
the SB_ATM block 48 one clock cycle after rd_req is as- 
serted. The process SI 00b then proceeds in process 
step S1 1 4, where the rd_ack signal is deasserted when 
at the first good byte or word is written into the RD FIFO 
50. This allows the ATI\/I_SYS block 54 to assert another 
rd_req, if there is one pending. The process SI 00b then 
terminates. 

In the event that a write request is honored upon 
completion of arbitration, the wr_ack signal is asserted, 
as shown in Figure 5b, The wrjarget, wr_address and 
wr_size signals from the ATM_SYS block 54 are regis- 
tered with the SB_ATM block 48 one clock cycle after 
the wr_req is asserted. The ATM_SYS block 54 can then 
be released to process other requests without waiting 
for or be held up by the wr_ack signal or lO arbitration. 
The write cycle is initiated in the meantime. At the time 
the first good byte or word is written into the WR FIFO, 
the wr_ack signal is deasserted. This allows the 
ATM_SYS block 54 to assert another wr_req, if there is 



one pending. 

The write control handshake process S120 will now 
be described with reference to Figure 5d, Beginning 
from a start state, the process SI 20 proceeds to deci- 

5 sion step S122, where it queries if a write request has 
been honored. If not, the process SI 20 proceeds back 
to decision step SI 22 to continue monitoring if a write 
request has been honored. If a request has been hon- 
ored, the process SI 20 proceeds to process step SI 24, 

10 wherethewr_acksignalisasserted. Atthe juncture, the 
process SI 20 proceeds in two parallel operations, rep- 
resented by processes SI 20a and SI 20b. Advancing 
from process step SI 24, the process SI 20a proceeds 
to process step SI 26, where the write cycle is initiated, 

15 The process SI 20a then proceeds to decision step 
SI 28, where it queries if writing has been completed. If 
not, the process S120a proceeds back to decision step 
SI 28 to continue monitoring for the completion of the 
writing process. If writing has been completed, the proc- 

20 ess SI 20a terminates. 

Advancing form process step SI 24, the process 
SI 20b proceeds to process step SI 30, where the 
wrjarget, wr_address and wr_size signals from the 
ATM_SYS block 54 are transferred and registered with 

25 the SB_ATM block 48 one clock cycle after the wr_req 
is asserted. The ATM_SYS block 54 can then be re- 
leased to process other requests without waiting for or 
be held up by the wr_ack signal or 10 arbitration. The 
process SI 20b then proceeds to process step SI 32, 

30 where the wr_ack signal is deasserted at the time the 
first good byte or word Is written into the WR FIFO. This 
allows the ATM_SYS block 54 to assert another wr_req, 
if there is one pending. The process SI 20 then termi- 
nates. 

35 Figures 6a and 6b are timing charts illustrating the 
timing cycles of the read and write data handshake sig- 
nals shown in Figures 4a and 4b. Figures 6c and 6d are 
flow charts illustrating the read and write data hand- 
shake processes shown in Figures 4a and 4b. 

40 A discussion of the read data handshake cycle will 
now be discussed. As shown in Figure 6a, the 
at_least_x_words Jilled flag Indicates at every mclk cy- 
cle the number (i.e., "x", where x is 2 words, 4 words, 
etc.) of storage locations in the RD Fl FO 50 that are fi I led 

45 with valid data. Similarly, the at_least_x_words_empty 
flag indicates to ATM_SYS block 54 the number of emp- 
ty storage locations in the WR FIFO 52 at every mclk 
cycle. The ATM_SYS block 54 determines when to as- 
sert the atm_rd_en signal based on either the assertion 

50 of the at_least_x_wordsJilled flag or the assertion of 
the rd_done signal. As shown, the ATM_SYS block 54 
may assert the atm_rd_en signal as desired, so long as 
the at_least_x_words Jilled flag is asserted. When the 
at_least_x_wordsJilled flag is deasserted in the tenth 

55 mclk cycle, the atm_rd_en signal is deasserted in the 
following (i.e., eleventh) mclk cycle. However, the as- 
sertion of the rd_done signal overrides the state of the 
at_least_x_wordsJilled flag. As shown, when the 
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rd_clone signal is asserted during the eleventh mclk cy- 
cle, when the at_least_x_words_fiiied flag Is deassert- 
ed, the atm_rd_en signal is asserted in the following (i. 
e., twelfth) mclkcycle. In this manner, the rd_done signal 
may be used for the last few words in the RD FIFO 50 
to be read, when there is no more System-IO bus 38 
read activity to bring new words so that the 
at_least_x_words_filled flag would not be asserted. 

The read data handshake process will now be dis- 
cussed with reference to Figure 6c. Beginning from a 
start state, the process SI 50 proceeds to process step 
SI 52, it determines whether to assert the atm_rd_en 
signal based on either the assertion of the 
at_least_x_words_filled flag or the assertion of the 
rd_done signal. If the at_least_x_words_filled flag is as- 
serted, the process SI 50 proceeds to decision step 
SI 54, where queries if the atm_rd_en signal should be 
asserted. If so, the process SI 50 proceeds to process 
step SI 56, where the atm_rd_en signal is asserted, and 
the process S150then returns to decision step 8152. If 
the atm_rd_en is to be deasserled, the process SI 50 
proceeds to process step S158, where the atm_rd_en 
signal is deasserted, upon which the process returns to 
decision step S 152. 

If the rd_done signal has been asserted, the proc- 
ess SI 50 proceeds to process step SI 760, where the 
atm_rd_en signal is asserted. In this manner, the 
rd_done signal may be used for the last few words in 
the RD FIFO 50 to be read, when there is no more Sys- 
tem-IO bus 38 read activity to bring new words so that 
the at_least_x_words_filled flag would not be asserted. 
The process SI 50 then terminates. 

A discussion of the write data handshake cycle will 
now be discussed. As shown in Figure 6b, the 
at_least_x„words„empty flag indicates to ATM_SYS 
block 54 the number of empty storage locations in the 
WR FIFO 52 at every mclk cycle. The ATM_SYS block 
54 determines when to assert the atm_wr_en signal 
based on the assertion of the at_least_x_words_empty 
flag. As shown, the ATM_SYS block 54 may assert the 
atm_wr_en signal as desired, so long as the 
at_least_x_words_empty flag is asserted. When the 
at_least_x_words_empty flag is deasserted in the tenth 
mclk cycle, the atm_wr_en signal is deasserted in the 
following (i.e., eleventh) mclk cycle. Since writing is un- 
der the control of the Core 22, and the System-IO inter- 
face 20 will eventually empty out the entire WR FIFO 
52, no lingering words will be left in the WR FIFO 52. As 
a result, there is no need to utilize a wr_done signal. 

The write data handshake process will now be dis- 
cussed with reference to Figure 6d. Beginning from a 
start state, the process SI 70 proceeds to decision step 
SI 72, where it queries if the at_least_x_words_empty 
flag is asserted. If so, the process S170 proceeds to 
process step SI 74, where the atm_wr_en signal is as- 
serted, upon which the process SI 70 returns to decision 
step SI 72. If the at_least_x_words_empty flag is deas- 
serted, the process SI 70 proceeds to process step 
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SI 76, where the atm_wr_en signal is deasserted. The 
process S170 then returns to decision step S172 to fur- 
ther monitor the at_least_x_words_empty flag 

The design and definition of the GIO interface 24 

5 provides maximum utilization of the lO bandwidth, spe- 
cifically, by allowing the System-IO bus 38 to accommo- 
date back-to-back DMA requests and to support the 
Core 22 segmentation and reassembly functions. In ad- 
dition, such definition would affect the level of pipe lining 

10 of the Gore 22 transmit load and receive unload DI\^A 
requests. It also allows 2 or 3 requests to be queued 
across the GIO Interface 24 at the same time in each 
read and write direction. In addition, the GIO interface 
24 permits the Core 22 to read or write data in incre- 

'5 ments rather than having to wait for the RD FIFO 50 to 
be completely full or for the WR Fl FO 52 to be complete- 
ly empty The duplex data path width of 32 bits each per 
read and write direction matches the data width of the 
external buffer memory interface 34 (Figure 3), so that 

20 no bandwidth is lost across the GIO interface 24. In ad- 
dition, the definition of a fixed Core 22 clock ensures 
that the Core 22 remains unchanged for different em- 
bodiments of the NIC 1 2 which interface to different host 
computer systems and busses. The last_rd and last_wr 

25 signals provide a means for 32-bit, 64-bit, 96-bit, 128-bit, 
etc. lO data path implementation without placing any re- 
strictions on the lO bus bit width. 

IVIodifications and variations of the embodiments 
described above may be made by those skilled in the 

30 technology while remaining within the true scope and 
spirit of this invention. Thus, although the present inven- 
tion has been described in terms of certain preferred 
embodiments, other embodiments that will be apparent 
to those of ordinary skill in the technology are also within 

35 the scope of this invention. Accordingly, the scope of the 
invention is intended to be defined only by the claims 
which follow. 



40 Claims 

1 . A method for coordinating data transfer between a 
System Input/Output (lO) functional block and an 

Asynchronous Transfer Mode (ATM) functional 
45 block, comprising the steps of: 

(a) providing a first signal from the ATM func- 
tional block to the System lO functional block, 
for requesting the reading of data from the Sys- 

50 tern lO functional block to the ATM functional 

block; 

(b) providing locational information related to 
the data to be transferred, from the ATM func- 

55 tional block to the System lO functional block: 

and 

(c) providing a second signal from the System 
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10 functional block to the ATM functional block, 
for acknowiedging the request for reading of 
data, wherein step (b) is Independent from step 

(c) . 

2. The nfiethod of Claim 1 , further comprising the steps 
of: 

(d) providing a third signal from the ATM func- 
tional block to the System 10 functional block, 
for requesting the writing of data from the ATM 
functional block to the System 10 functional 
block; and 

(e) providing locational information related to 
the data to be transferred, from the ATM func- 
tional block to the System 10 functional block; 
and 

(f) providing a fourth signal from the System lO 
functional block to the ATM functional block, for 
acknowledging the request for writing of data, 
wherein step (e) is independent from step (f), 
and wherein steps (a)-(c) are Independent from 
steps (d)-(f). 

3. The method of Claim 1 , further comprising the steps 
of: 

(d) reading dataf rom the System lO functional 
block and providing the data to the ATM functional 
block, wherein the step of reading data Is asynchro- 
nous from steps (a) -(c). 

4. The method of Claim 3, further comprising the step 

(e) writing data located in the ATM functional 
block, to the System 10 functional block, wherein 
the step of writing is provided In parallel to step (d). 

5. The method of Claim 1 , wherein the step (b), the 
locational information includes information related 
to a target device for the request, a memory address 
from which to retrieve the data and a size of the data 
to be transferred. 

6. The method of Claim 2, wherein the System 10 func- 
tional block is synchronous to a first clock, the ATM 
functional block Is synchronous to a second clock 
wh Ich has a clock rate that Is different from the clock 
rate of the first clock and wherein the first, the sec- 
ond, the third and the fourth signals are synchro- 
nous to the second clock. 

7. The method of Claim 1 , further comprising the step 
of providing a last read signal from the ATM func- 
tional block to the System lO functional block, to in- 
dicate that the last word of data has been read from 
the System lO functional block. 
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8. The method of Claim 2, further comprising the step 
of providing a last write signal from the ATM func- 
tional block to the System lO functional block, to In- 
dicate that the last word of data has been written to 

5 the System 10 functional block. 

9. An interface for transferring data, comprising: 

a System 10 functional block having a first buff- 
10 er and a second buffer; and 

an ATM functional block which provides a first 
signal to the System lO functional block, for re- 
questing the reading of data from the System 
IS 10 functional block to the ATM functional block, 

the ATM functional block also providing loca- 
tional information related to the data to be 
transferred to the System lO functional block; 

20 the System 10 functional block providing a sec- 

ond signal to the ATM functional block, for ac- 
knowledging the request for reading of data, 
wherein locational Information is provided Inde- 
pendently from provision of the second signal. 

25 

10. The interface of Claim 9, wherein the ATM function- 
al blockf urther provides a third signal to the System 
lO functional block, for requestingthe writing of data 
from the ATM functional block to the System lO 

30 functional block, the ATM functional block also pro- 
viding locational Information related to the data to 
be transferred, to the System lO functional block, 
and wherein the System lO functional block further 
provides a fourth signal to the ATM functional block, 

3S for acknowledging the request for writing of data, 
wherein locational information is provided Inde- 
pendently from provision of the fourth signal and 
wherein the first signal is provided asynchronously 
with the third signal. 

40 

11. The interface of Claim 9, wherein the ATM function- 
al blockf urther reads data from the System 10 func- 
tional block, and wherein reading of the data is 

asynchronous to the provision of the first signal. 

45 

12. The interface of Claim 11, wherein data is written 
from the ATM functional block to the System 10 
functional block, and wherein the writing of data is 
provided in parallel to the reading of data. 

50 

13. The interface of Claim 9, wherein the locational in- 
formation includes information related to a target 
device for the request, a memory address from 
which to retrieve the data and a size of the data to 

55 be transferred. 

14. The interface of Claim 10, wherein the System lO 
functional block is synchronous to a first clock, the 
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ATM functional block is synchronous to a second 
clock which has a clock rate that is different form 
the clock rate of the first clock, and wherein the first, 
the second, the third and the fourth signals are syn- 
chronous to the second clock. 

15. The interface of Claim 9, wherein the ATM function- 
al block further provides a last read signal from the 
ATM functional block to the System lO functional 
block, to indicate that the last word of data has been 

read from the System 10 functional block. 

16. The interface of Claim 10, wherein the ATM func- 
tional block fu rther p rovides a last write signal to the 
System 10 functional block, to indicate that the last 
word of data has been written to the System lOfunc- 
tional block. 

17. A System for transferring data in an Asynchronous 
Transfer Mode network, comprising: 



a host processor; 

a memory coupled to the host processor; 

a System input/output bus coupled to the host 
processor; and 

an interface comprising: 

a System lO functional block coupled to the 
System input/output bus, the System lO func- 
tional block having a first buffer and a second 
buffer; 



and wherein the System lO functional block further 
provides a fourth signal to the ATM functional block, 
for acknowledging the request for writing of data, 
wherein locational information is provided inde- 
pendently from provision of the fourth signal and 
wherein the first signal is provided asynchronously 
with the third signal. 

19. The System of Claim 1 7, wherein the ATM function- 
al blockfurther reads data from the System 10 func- 
tional block, and wherein reading of the data is syn- 
chronous to the provision of the first signal. 

20. The interface of Claim 18, wherein data is written 
from the ATM functional block to the System lO 
functional block, and wherein the writing of data is 
provided in parallel to the reading of data. 

21 . The Interface of Claim 1 8, wherein the locational in- 
formation Includes Information related to a target 
device for the request, a memory address from 
which to retrieve the data and a size of the data to 
be transferred. 

25 22. The interface of Claim 18, wherein the System 10 
functional block is synchronous to a first clock, the 
ATM functional block is synchronous to a second 
clock which has a clock rate that is different from 
the clock rate of the first clock, and wherein the first, 

30 the second, the third and the fourth signals are syn- 
chronous to the second clock. 



an ATM functional block coupled to the System 
10 functional block, the ATM functional block 
providing a first signal to the System lO func- 
tional block, for requesting the reading of data 
from the System lO functional block to the ATM 
functional block, the ATM functional block also 
providing locational Information related to the 
data to be transferred to the System 10 func- 
tional block; and 



the System 10 functional block providing a sec- 
ond signal to the ATM functional block, for ac- 
knowledging the request for reading of data, 
wherein locational information is provided inde- 
pendently from provision of the second signal, so 

18. The System of Claim 1 7, wherein the ATM function- 
al block further provides a third signal to the System 
lO functional block, for requesting the writing of data 
from the ATM functional block to the System lO ss 
functional block, the ATM functional block also pro- 
viding locational information related to the data to 
be transferred, to the System lO functional block, 
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